Advanced Statistical Models for Software Data

نویسندگان

  • Giancarlo Succi
  • Milorad Stefanovic
  • Witold Pedrycz
چکیده

In this paper, we provide a framework for investigation and quantification of impact of object-oriented design choices on the defects in software systems. We report the initial results of an extensive case study, which strongly reinforce earlier, mainly anecdotal, evidence that design aspects related to inheritance and communication between classes can be used as indicators of the most defect-prone classes. To deal with specifics of software metrics data, statistical models applicable for the non-normally distributed count data are used, such as Poisson regression, negative binomial regression, and zero-inflated negative binomial regression. Alberg diagrams are applied to assess the models’ ability to identify the most critical classes in the system. The zero-inflated negative binomial regression model, designed to explicitly model the occurrence of zero counts in the dataset, shows the best ability to describe the high variability in the dependent variable.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Thermal conductivity of Water-based nanofluids: Prediction and comparison of models using machine learning

Statistical methods, and especially machine learning, have been increasingly used in nanofluid modeling. This paper presents some of the interesting and applicable methods for thermal conductivity prediction and compares them with each other according to results and errors that are defined. The thermal conductivity of nanofluids increases with the volume fraction and temperature. Machine learni...

متن کامل

Thermal conductivity of Water-based nanofluids: Prediction and comparison of models using machine learning

Statistical methods, and especially machine learning, have been increasingly used in nanofluid modeling. This paper presents some of the interesting and applicable methods for thermal conductivity prediction and compares them with each other according to results and errors that are defined. The thermal conductivity of nanofluids increases with the volume fraction and temperature. Machine learni...

متن کامل

Development of Software Tools for Ecological Field Studies Using ArcPad

Integration of data collection, statistical analysis and dynamic modeling in ecology requires new hardware and software tools that offer mobile data management together with implementation of advanced calculation methods. Recent advances in mobile computing representing by mobile devices and ESRI’s ArcPad make possible to automate a number of procedures. In addition to mapping of spatial and te...

متن کامل

Prediction of Rainfall under HadCM3 and CanESM2 Climate Change Models using Statistical Downscaling Model (Case Study: Tabriz Synoptic Station)

Global climate change as a main factor affecting all ecological components, has been attended by researchers all over the world in the recent years. In this regard for simulating the rainfall, National Centers for Environmental Prediction (NCEP) data, HadCM3 data under A2 and B2 scenarios, CanESM2 data under RCP2.6, RCP4.5 and RCP8.5 scenarios were utilized. This research was performed by adopt...

متن کامل

Exact Mixed Integer Programming for Integrated Scheduling and Process Planning in Flexible Environment

This paper presented a mixed integer programming for integrated scheduling and process planning. The presented process plan included some orders with precedence relations similar to Multiple Traveling Salesman Problem (MTSP), which was categorized as an NP-hard problem. These types of problems are also called advanced planning because of simultaneously determining the appropriate sequence and m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001